ACIRD: Intelligent Internet Document Organization and Retrieval

نویسندگان

Shian-Hua Lin

Meng Chang Chen

Jan-Ming Ho

Yueh-Ming Huang

چکیده

This paper presents an intelligent Internet information system, Automatic Classifier for the Internet Resource Discovery (ACIRD), which uses machine learning techniques to organize and retrieve Internet documents. ACIRD consists of a knowledge acquisition process, document classifier and two-phase search engine. The knowledge acquisition process of ACIRD automatically learns classification knowledge from classified Internet documents. The document classifier applies learned classification knowledge to classify newly collected Internet documents into one or more classes. Experimental results indicate that ACIRD performs as well or better than human experts in both knowledge acquisition and document classification. By using the learned classification knowledge and the given class lattice, the ACIRD two-phase search engine responds to user queries with hierarchically structured navigable results (instead of a conventional flat ranked document list), which greatly aids users in locating information from numerous, diversified Internet documents. Index Terms : Document Classification, Data Mining, Information Retrieval, and Search Engine.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ACIRD: Intelligent Internet Documents Organization and Retrieval

In this paper, we present an intelligent Internet information system ACIRD using machine learning techniques to organize and retrieve Internet Web documents. ACIRD consists of three parts: knowledge acquisition, document classifier and two-phase search engine. The knowledge acquisition of ACIRD automatically learns the classification knowledge from classified Internet Web documents and the clas...

متن کامل

ACIRD: An Intelligent Internet Information System Based on Data Mining (Extended Abstract)

The explosive growth of the Internet dramatically changes the way of working and living that the Internet becomes a major source of information. However, the excessive information on the Internet creates the information overflow problem. As a result, information retrieval (IR) systems (or search engines) come to help the Internet users to alleviate the problem. The conventional IR systems are d...

متن کامل

IDSIS: Intelligent Document Semantic Indexing System

System Zhongzhi Shi Bin Wu Qing He Xiujun Gong Shaohui Liu Yi Zheng [email protected] Key Laboratory of Intelligent Information Processing , Institute of Computing Technology ,Chinese Academy of Sciences Abstract: With rapid growth of the Internet, how to get information from this huge information space becomes an even important problem. In this paper, An Intelligence Document Semantic Indexi...

متن کامل

Multiple Word senses and Information Retrieval: An application using thesaurally derived Lexical Chains

The primary objective of this work is to Improve Internet based Information Retrieval. Currently Internet search engines retrieve a heterogeneous collection of documents of varied quality. Whilst many are “relevant” to the search terms used, many others coincidentally contain a matched word. They do not, in other words, have meaningful content. An enabling objective is to develop a "weakly" int...

متن کامل

Intelligent Document Retrieval - Exploiting Markup Structure

We present here because it will be so easy for you to access the internet service. As in this new era, much technology is sophistically offered by connecting to the internet. No any problems to face, just for this day, you can really keep in mind that the book is the best book for you. We offer the best here to read. After deciding how your feeling will be, you can enjoy to visit the link and g...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

IEEE Trans. Knowl. Data Eng.

دوره 14 شماره

صفحات -

تاریخ انتشار 2002

ACIRD: Intelligent Internet Document Organization and Retrieval

نویسندگان

چکیده

منابع مشابه

ACIRD: Intelligent Internet Documents Organization and Retrieval

ACIRD: An Intelligent Internet Information System Based on Data Mining (Extended Abstract)

IDSIS: Intelligent Document Semantic Indexing System

Multiple Word senses and Information Retrieval: An application using thesaurally derived Lexical Chains

Intelligent Document Retrieval - Exploiting Markup Structure

عنوان ژورنال:

اشتراک گذاری